insight review articles 



Disease proteomics 

Sam Hanash 

Department of Pediatrics, University of Michigan, 1150 West Medical Center Drive, Ann Arbor, Michigan 48109, USA 
(e-mail: shanash@umich.edu) 



The sequencing of the human genome and that of numerous pathogens has opened the door for proteomics 
by providing a sequence-based framework for mining proteomes. As a result, there is intense interest in 
applying proteomics to foster a better understanding of disease processes, develop new biomarkers for 
diagnosis and early detection of disease, and accelerate drug development. This interest creates numerous 
opportunities as well as challenges to meet the needs for high sensitivity and high throughput required for 
disease-related investigations. 



Despite tremendous advances in our 
understanding of the molecular basis of 
diseases such as cancer, substantial gaps 
remain both in our understanding of 
disease pathogenesis and in the development 
of effective strategies for early diagnosis and for 
treatment. The current interest in proteomics is due in 
part to the prospects that a proteomic approach to 
disease investigations will overcome some of the 
limitations of other approaches'. The opportunities as 
well as the challenges facing disease proteomics are 
formidable. Particularly promising areas of research 
Include: delineation of altered protein expression, not 
only at the whole-cell or tissue levels, but also in 
subcellular structures, in protein complexes and in 
biological fluids; the development of novel biomarkers for 
diagnosis and early detection of disease; and the 
identification of new targets for therapeutics and the 
potential for accelerating drug development through more 
effective strategies to evaluate therapeutic effect and 
toxicity. 

The dynamic nature of the proteome of a cell or a tissue 
provides amplejustification for studying gene expression in 
disease directly at the proteomic level. But capturing this 
dynamic state represents a technological challenge. 
Undoubtedly, tackling the numerous facets of disease 
proteomics requires implementation of multiple strategies 
and technology platforms. 

Proteome profiling technologies are currently evolving 
in a manner that emphasizes the need for sensitivity and 
throughput. No one technology is likely to emerge that 
will meet the needs of all types of proteomics-based investi- 
gations, from expression proteomics to functional 
proteomics, particularly as they relate to disease. 

The use of two-dimensional gels 

During the early years of proteomics and until relatively 
recently, profiling of protein expression in disease relied 
primarily on the use of two-dimensional polyacrylamide 
gel electrophoresis (2D PAGE), which was later combined 
with mass spectrometry 2 . Most studies of this nature 
followed an approach in which a cocktail was used to 
solubilize the protein contents of an entire cell population, 
tissue or biological fluid, followed by separation of the 
protein contents of the lysate using 2D gels and visualiza- 
tion of the separated proteins using silver staining. It 
became clear that such an approach allows only a limited 
display of protein content that consisted of relatively abun- 
dant proteins. Nevertheless, profiling of disease tissues 
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using this approach has had some utility. For example, it 
was demonstrated long before the use of DNA microarrays 
that leukaemias could be classified into their different sub- 
types using 2D PAGE 3 . 

Numerous other studies have also identified disease- 
related changes in protein expression, primarily using 
2D PAGE and mass spectrometry. One such example is 
provided by studies of heart disease 4 , the spectrum of 
which encompasses a broad set of pathological conditions, 
some with acute onset of severe disease and others with 
slow, chronic progression. To assist in data gathering and 
mining, online 2D gel-derived databases of protein expres- 
sion in the myocardium for human and other species were 
constructed 5,6 . These databases have allowed investigators 
to compare data and establish reference standards. 
Relevant findings have emerged from studies of changes in 
myocardial proteins associated with human heart failure as 
well as from studies of animal models of hear t failure and of 
isolated rat myocytes 4 . Although only a small proportion of 
the proteome has been analysed, pronounced changes in 
the composition of the cardiac proteome have been found, 
affecting proteins with diverse functions. Altered overall 
levels of specific proteins or altered post-translational 
modifications of proteins such as myosin light chain 2 
have been reported in the failing heart 7 . And protein 
expression studies have uncovered proteins that exhibited 
new disease-related post-translational modifications with 
predicted functional relevance 8-11 . This level of progress 
typifies that made in profiling disease tissue in a wide 
variety of diseases. 

An important development in 2D PAGE is the use 
of immobilized pH gradients (IPGs) in which the pH 
gradient is fixed within the acrylamide matrix. IPGs also 
allow production of gels that cover a defined pH range, 
from wide to narrow 12,13 . A variation on this theme is 
the use of so-called zoom gels' in which the protein 
contents of an individual sample are first fractionated into 
narrow pH ranges under low resolution, and then 
each fraction undergoes high-resolution separation by 2D 
PAGE 14 . For example, the pre-fractionation of serum using 
this approach has enhanced the ability to detect low- 
abundance and potentially new circulating disease-marker 
proteins 14 . Yet another innovation in 2D gels Is the use of 
differential in-gel electrophoresis (DICE), in which 
two pools of proteins are labelled with different 
fluorescent dyes lS . The labelled proteins are mixed and 
separated in the same 2D gel. In one study 16 , 2D DIGE 
was applied to quantify the differences in protein expres- 
sion between oesophageal carcinoma cells and normal 
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epithelial cells. A large number of proteins were found to be either 
upregulated or downregulated in cancer cells. 

Sample fractionation prior to analysis 

Some of the main challenges facing expression proteomics, be it 
using 2D PAGE or any other approach, include the great dynamic 
rainge of protein abundance and a wide range of protein properties, 
including mass, isoelectric point, extent of hydrophobicity and 
post-translational modifications. Reducing sample complexity prior 
to analysis — for example, by analysing protein subsets and subcellu- 
lar organelles separately — improves the reach of 2D gels or other 
separation techniques for the quantitative analysis of low-abundance 
proteins. An elegant demonstration of the power of sub-proteome 
analysis is illustrated in studies of phagosomes 17 ; these have led to the 
identification of over 250 proteins from this organelle and the 
demonstration that phagosomes are formed by direct association 
and fusion of endoplasmic reticulum to the plasma membrane 
during early phagocytosis. 

The isolation of sub-proteomes may be combined with protein 
tagging to further enhance sensitivity, as in the case of surface- 



membrane proteins, a compartment rich in diagnostic and 
therapeutic targets. Protein tagging technologies are currently being 
implemented for the comprehensive analysis of the cell-surface 
proteome (Fig. 1). A surface-protein biotinylation strategy, coupled 
with the use of mass spectrometry, was applied to the gastric 
pathogen Helicobacter pylori, leading to the identification of new 
surface-membrane proteins 18 . This strategy has also led to the 
detection and identification of many new proteins on the surface of 
cancer cells (Fig. I) 19 . 

Beyond 2D gels for disease expression profiling 

Even with all the improvements that could be introduced, 2D gels will 
probably remain a rather low-throughput approach that requires a 
relatively large amount of sample. The latter is particularly problem- 
atic for clinical samples, as such samples are generally procured in 
limited amounts. Furthermore, tissue heterogeneity complicates the 
analysis of clinical samples. Various tissue microdissection 
approaches are beneficial to reduce heterogeneity, but they further 
reduce the amount of sample available. In particular, the use of 
laser-capture microdissection, which allows defined cell types to be 
isolated from tissues, yields amounts of proteins that are difficult to 
reconcile with the need for greater amounts for 2D gels 1 . 

Undoubtedly, various non-gel-based schemas that rely on liquid- 
based separations of proteins or peptides, with or without tagging, 
will have utility for disease proteomics, particularly given their 
potential for automation. Additionally, advances in microfluidic 
technology will likely allow automated separation of proteins In 
complex lysates using much reduced sample amounts. Microfluidic 
systems already have been integrated with mass spectrometry for 
protein digestion and Identification 20 . 

Non-separation-based strategies, including direct profiling using 
mass spectrometry or the use of protein microarrays, are important 
developments. Mass spectrometry has been applied to the in situ pro- 
teomic analysis of tissues, an approach that allows imaging of protein 
expression in normal and disease tissues 21 . By this method, frozen tis- 
sue is sliced and sections are applied on a matrix-assisted laser des- 
orption/ionization (MALDI) plate and analysed at regular spatial 
intervals. The mass spectra obtained at different intervals are com- 
pared, yielding a spatial distribution of Individual masses across the 
tissue section (Fig. 2) . Mass profiles of tissue sections obtained from 
normal and disease tissues may be compared to detect altered protein 
expression. Tumour analyses using this approach have uncovered 
differences in protein expression between normal and tumour tissues 
that may have specificity for different tumour types 21 . 

DNA and protein microarrays in disease investigation 
Cancer profiling using DNA microarrays 

Profiling gene expression using DNA arrays has had a tremendous 
impact on biomedical research. Disease-related applications of DNA 
microarrays include uncovering unsuspected associations between 
genes and specific clinical features of disease that are helping devise 
new molecular-based classifications of disease. In relation to cancer, 
most published studies of tumour analysis using DNA microarrays 
have examined pathologically homogeneous sets of tumours to 
identify clinically relevant subtypes (for example, responders versus 
non-responders) , pathologically distinct subtypes of tumours of the 
same lineage to identify molecular correlates (for example, high- 
stage versus low-stage tumours), or tumours of different lineages to 
identify molecular signatures for each lineage. 

Published studies of breast cancer illustrate the potential contri- 
bution of DNA microarrays to uncover new disease subtypes. In one 
study, tumours could be classified into a basal epithelial-like group, 
an ErbB2-overexpresslng group and a normal breast-like group 22 . In 
a later study, survival analyses on a sub-cohort of paUents with locally 
advanced breast cancer snowed significantly different outcomes for 
patients belonging to the various groups, despite uniform 
treatment 23 . In an Independent study of 38 invasive breast cancers, 
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striking molecular differences between ductal carcinoma specimens 
were uncovered that led to a suggested new classification for oestro- 
gen-receptor (ER) -negative breast cancer 24 . Similarly, a study of 
58 node-negative breast carcinomas discordant for ER status also 
uncovered a list of genes that discriminated tumours according to ER 
status 25 . More recently, gene expression profiling was found to be a 
more powerful predictor of disease outcome in young patients with 
cancer than clinical- and histological-based classifications 26 . 

DNA versus protein microarrays 

The DNA microarray studies described above, as well as numerous 
others in the literature, indicate the great utility of DNA microarrays 
for uncovering patterns of gene expression that are clinically infor- 
mative. An Important challenge for microarray analysis of disease tis- 
sues and cells is to understand at a mechanistic level the significance 
of associations observed between subsets of genes and clinical fea- 
tures of disease. Another challenge is to identify the smallest but most 
informative sets of genes associated with specific clinical features, 
which then can be interrogated using technologies available in clini- 
cal laboratories. Yet another challenge is to determine how well RNA 
levels of predictive genes correlate with protein levels. A lack of 
correlation may imply that the predictive property of the gene(s) is 
Independent of gene function. For example, comparisons of messen- 
ger RNA and protein levels for the same tumours reported for lung 
cancer demonstrated that only a small percentage of genes had a 
statistically significant correlation between the levels of their 
corresponding proteins and mRNAs 27 . 

Technologies for DNA microarray analysis are still evolving. 
There is a tendency by manufacturers to favour oligonucleotide- over 
complementary DNA-based microarrays, and progress has been 
made on adoption of data analysis standards 28 . Nevertheless, 
however perfected DNA microarrays and their analytical tools 
become for disease profiling, they will not eliminate a pressing need 
for other types of profiling technologies that go beyond measuring 
RNA levels, particularly for disease-related investigations. DNA 
microarrays have limited utility for the analysis of biological fluids 
and for uncovering assayable biomarkers directly in the fluid. 
Numerous alterations may occur in proteins that are not reflected in 
changes at the RNA level, providing a compelling rationale for direct 
analysis of gene expression at the protein level. As a result, there is 
substantial Interest in developing microarrays or biochips that allow 
the systematic analysis of thousands of proteins (see review in this 
issue by Fields and co-workers, page 208) . 

Unlike DNA microarrays, which provide one measure of gene 
expression (namely RNA levels) , there is a need to implement protein 
microarray strategies that address the many different features of pro- 
teins that can be altered in disease. These include, on the one hand, 
determination of their levels in biological samples and, on the other, 
determination of their selective interactions with other biomolecules, 
such as other proteins, antibodies, drugs or various small llgands. The 
compelling need for protein chips has led numerous biotechnology 
companies to devise new strategies for producing biochips that have 
utility for biomedical investigations. New classes of capture agents 
include aptamers (SomaLoglc, http://www.somalogic.com/), 
ribozymes (Archemix, http://www.archemix.com/), partial-mole- 
cule imprints (Asplra Biosystems, http://www.asplrabio.com) and 
modified binding proteins (Phylos, http://www.phylos.com). For 
assays of protein interaction, biochips that contain either peptides or 
proteins are being produced. Peptides may be synthesized in very large 
numbers directly on the chip 29 . Alternatively, recombinant proteins 
may be arrayed and effort is underway to assemble large sets of puri- 
fied recombinant proteins for microarrays and other applications. 

Profiling studies of disease tissue that have used protein micro- 
arrays are beginning to emerge. As a model to better understand how 
patterns of protein expression shape the tissue microenvironment, 
Knezevic et al. analysed protein expression in tissue derived from 
squamous cell carcinomas of the oral cavity through an antibody 
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microarray approach for high-throughput proteomlc analysis 30 . 
Using laser-capture microdissection to procure total protein from 
specific microscopic cellular populations, they showed that quantita- 
tive, and potentially qualitative, differences in expression patterns of 
multiple proteins within epithelial cells correlated reproducibly with 
oral-cavity tumour progression. Differential expression of multiple 
proteins was found in stromal cells surrounding and adjacent to 
regions of diseased epithelium that correlated directly with tumour 
progression of the epithelium. Most of the proteins identified in both 
cell types were involved in signal transduction pathways. Knezevic et 
al. hypothesized therefore that extensive molecular communications 
involving complex cellular signalling between epithelium and 
stroma play a key role in driving progression of oral-cavity cancer. 

A reverse-phase protein array approach that immobilizes the 
whole repertoire of a tissue s proteins has been developed 31 . A high 
degree of sensitivity, precision and linearity was achieved, making it 
possible to quantify the phosphorylated status of signal proteins in 
subpopulations of human tissue cells. Using this approach, Paweletz 
era/. 31 performed a longitudinal analysis of the state of pro-survival 
checkpoint proteins at the microscopic transition stage from patient- 
matched, histologically normal prostate epithelium to prostate 
intraepithelial neoplasia and to invasive prostate cancer. Cancer 
progression was associated with increased phosphorylation of the 
serine/threonine kinase Akt, suppression of apoptosls pathways, and 
decreased phosphorylation of extracellular signal-regulated kinase 
(ERK). At the transition from histologically normal epithelium to 
intraepithelial neoplasia, a statistically significant surge in 
phosphorylated Akt was observed, together with a concomitant 
suppression of downstream apoptosis pathways preceding the 
transition into invasive carcinoma. 

A clinically relevant application of protein microarrays is the 
identification of proteins that Induce an antibody response in 
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autoimmune disorders 32 . Microarrays were produced by attaching 
several hundred proteins and peptides to the surface of derivatized 
glass slides. Arrays were incubated with patient serum, and 
fluorescent labels were used to detect autoantibody binding to 
specific proteins in autoimmune diseases, including systemic lupus 
erythematosus and rheumatoid arthritis. Such microarrays 
represent a powerful tool to study immune responses in a variety of 
diseases, including cancer. 

One of the main challenges in making biochips for global analysis 
of protein expression is the current lack of comprehensive sets of 
genome-scale capture agents such as antibodies. Another important 
consideration in protein microarrays is that proteins undergo 
numerous post-translational modifications that may be crucial to 
their functions. But these modifications are generally not captured 
using either recombinant proteins or antibodies that do not 
distinctly recognize specific forms of a protein. One approach for 
comprehensive analysis of proteins in their modified forms is to array 
proteins isolated directly from cells and tissues following protein 
fractionation schemes 33 . Fractions that react with specific probes are 
within the reach of chromatographic and gel-based separation 
techniques for resolving their individual protein constituents, and of 
mass spectrometrlc techniques for identification of their constituent 
proteins. Protein microarrays of different types are likely to become 
commercially available for clinically relevant assays of broad sets of 
proteins and may well rival DNA microarrays for introduction into 
the clinical laboratory. 

The quest for disease biomarkers using proteomics 

There is substantial interest in applying proteomics to the identifica- 
tion of disease markers. Approaches include comparative analysis of 
protein expression in normal and disease tissues to identify aberrant- 
ly expressed proteins that may represent new markers, analysis of 
secreted proteins in cell lines and primary cultures, and direct serum 
protein profiling. The potential of mass spectrometry to yield 
comprehensive profiles of peptides and proteins in biological fluids 
without the need to first carry out protein separations has attracted 
interest. In principle, such an approach is highly suited for marker 
identification because of reduced sample requirements and high 
throughput. 

This approach is currently popularized, particularly for serum 
analysis, by the technology referred to as surface-enhanced laser 
desorptlon/ionization 1 . Microlltre quantities of serum from many 
samples are applied to the surface of a protein-binding plate, with 
properties to bind a class of proteins. The bound proteins are treated 
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and analysed by MALDI. The mass spectra patterns obtained for 
different samples reflect the protein and peptide contents of these 
samples. Patterns that distinguish between cancer patients and 
normal subjects with remarkable accuracy have been reported for 
several types of cancer 1 . The main drawbacks of direct analysis of 
tissues or biological fluids by MALDI are the preferential detection of 
proteins with a lower molecular mass and the difficulty in determin- 
ing the identity of proteins owing to post-translational modifications 
obscuring the correspondence of measured and predicted masses. 
Occasionally the masses observed match precisely the predicted 
masses of specific proteins. This was the case in a study of proteins 
secreted by stimulated CD8 T cells, which led to the identification of 
the small proteins a-defensin 1, 2 and 3 as contributing to the anti- 
HIV-1 activity of CD8 antiviral factor 34 . 

A productive approach for the identification of cancer markers 
has been the analysis of serum for autoantibodies against tumour 
proteins. There is increasing evidence for an Immune response to 
cancer in humans, demonstrated in part by the identification of 
autoantibodies against a number of intracellular and surface anti- 
gens detectable in sera from patients with different cancer types 35 . 
The identification of panels of tumour antigens that elicit an anti- 
body response may have utility in cancer screening, diagnosis or in 
establishing prognosis, and in immunotherapy against the disease. 

There are several approaches for the detection of tumour antigens 
that induce an immune response 35 . A number of antigens have been 
detected by screening expression libraries with patient sera 36 " 41 or, 
more recently, by using a random peptide-library approach 42 . Multi- 
ple proteins that induce autoantibodies that are specific for different 
types of cancer have been identified using 2D gels to separate tumour 
proteins, followed by western blotting and incubation with patient 
sera 43 . For most antigenic proteins identified using this approach, 
post-translational modifications contributed to the immune 
response. In a study of lung cancer, sera from 60% of patients with 
lung adenocarcinoma and from 33% of patients with squamous-cell 
lung carcinoma, but from none of the non-cancer controls, exhibited 
immunoglobulin-7-based reactivity against proteins identified as 
glycosylated annexins I and II 44 . Microarrays that contain proteins 
derived from tumour cells have the potential of substantially 
accelerating the pace of discovery of tumour antigens and yielding a 
molecular signature for immune responses directed against protein 
targets in different types of cancer 33 . 

The increased emphasis on proteomics for disease investigations 
is stimulating a reassessment of strategies for sample procurement 
and preservation to render them compatible with proteomics, 
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because of the inherent instability of proteins. For example, the man- 
ner in which biological fluids such as serum or plasma are collected Is 
not ideally suited for proteomlcs. There is a need to reduce protein 
degradation and other forms of modifications that may substantially 
alter protein content and interfere with global profiling. 

Disease-related functional proteomics 

Although data obtained by various expression proteomics strategies 
have functional relevance by uncovering altered levels or post- 
translational modification states of proteins in disease, additional 
technologies are needed for more direct functional analysis. This is 
exemplified by the need to analyse protein complexes and their 
disruption in disease, to assay in a high-throughput fashion the activ- 
ity of various classes of proteins, and to manipulate the levels and 
activities of individual proteins, in a cellular context, to determine 
their role in different biological processes and disease states. 

Various strategies are currently in use for studies of protein 
complexes and protein-protein interactions (see review In this issue 
by Fields and co-workers, page 208) . So far, such strategies have been 
applied to disease investigations on a limited basis, with a relatively 
narrow focus on particular complexes. For example, using affinity 
pull-down assays, 2D gels and mass spectrometry, myocardial 
protein kinase Ce (PKCe) was found to be associated physically with 
at least 36 other proteins with a multitude of functions. Cardiopro- 
tection induced by activation of PKCe was found to be coupled with 
dynamic modulation and recruitment of PKCe-associated 
proteins 45 . Previously unrecognized functions of PKCe have 
relevance to heart function and heart disease. Clearly, at the present 
time, most systematic studies of protein-protein Interactions have 
dealt with normal or physiological states, as the field is still in its early 
stages and the merits of various technologies, particularly for disease 
investigations, have yet to be fully appreciated. 

Likewise, there has been limited application of activity-based 
proteomics to disease investigations, although the field itself is still in 
its early stages 46-48 . Activity-based assays of normal and disease tissue 
have substantial relevance to the study of disease, but probably will be 
limited to the analysis of one class of proteins at a time. The potential 
contribution of this technique to the measurement of global dynam- 
ics in protein function is illustrated by the application of a chemical 
proteomics strategy to quantitatively compare enzyme activities 
across normal and disease tissue 47 (Fig. 3). A global analysis of the 
activity, subcellular distribution and glycosylation state for the serine 
hydrolase superfamily in a panel of human breast and melanoma cell 
lines resulted in the identification of a cluster of proteases lipases and 
esterases that distinguished cancer lines based on tissue of origin. 
Remarkably, the majority of these enzyme activities were downregu- 
lated in the most Invasive cancer lines examined, which instead 
upregulated a distinct set of secreted and membrane-associated 
enzyme activities. 

Approaches are being developed or implemented to allow direct- 
ed protein manipulations other than by decreasing or increasing 
overall amounts of protein, as can be done by gene manipulations. 
One advantage of these approaches is the ability to inactivate a specif- 
ic site in a protein in a time-dependent and localization-restricted 
manner, as with chromophore-assisted laser inactivation (CALI). 
Such inactivation allows assessment of the disease relevance of the 
site and provides a means to revert a cellular phenotype to a more 
normal state 49 (Fig. 4). Inactivation of protein targets Involved in 
different slgnal-transductlon pathways in cancer using CALI has 
been demonstrated 49 . 

Contributions of proteomics to studies of pathogens 

Despite earlier predictions to the contrary, infectious diseases remain 
as a leading cause of death worldwide. A complicating factor in 
therapy for infectious disease is the development of resistance to 
commonly used drugs (for example, as has occurred in tuberculosis) , 
which heightens the need for developing effective new therapies. 
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Interest in the application of proteomics to microbiology goes back at 
least two decades, with the pioneering work of Fred Neidhardt to 
characterize protein expression patterns in Escherichia coli under 
different growth conditions 50 . The complete sequencing of a number 
of microbial genomes has provided a framework for Identifying 
proteins encoded in these genomes using mass spectrometry. A case 
in point is the sequencing of the genome of the malaria parasite 
Plasmodium falciparum, which has provided a basis for conducting 
comparative proteomics studies of this pathogen, leading to the 
identification of new potential drug and vaccine targets 51 . Aside 
from comprehensive Identification of microbial proteins, 
proteomics is relevant to numerous aspects of microbial disease 
pathogenesis and treatment 53 " 57 (Table I) . 

Contribution of proteomics to drug development 

There is currently a burgeoning interest in proteomics on the part of 
the pharmaceutical industry, evidenced by implementation of 
proteomics programmes by most major pharmaceutical companies. 
The notion has been advanced that, as the vast majority of drugs 
target proteins, proteomics should have substantial utility for drug 
development. But the industry has so far adopted a cautious attitude, 
and it is too early to make a critical assessment of the contributions of 
proteomics to drug development, relative to other approaches. The 
caution stems from the prior heavy investment in genomics and 
other approaches and some uncertainty surrounding the 
adequacy and scalability of proteomics to meet the needs of the 
pharmaceutical industry. Provided suitable technology platforms 
become available, the use of proteomics may permeate numerous 
aspects of drug development, by identifying new targets and 
facilitating assessment of drug action and toxicity both in the 
preclinical and clinical phases. 

Several published studies illustrate the application of functional 
proteomics for Identification of regulated targets In specific 
pathways. Lewis et a/. 58 have combined functional proteomics with 
selective activation and inhibition of mitogen-activated protein 
kinase (MAPK) kinase (MKK), in order to identify cellular targets 
regulated by the MKK/ERK cascade. Twenty-five targets of this 
signalling pathway were Identified, of which only five were previously 
characterized as MKK/ERK effectors. The remaining targets suggest 
new roles for this signalling cascade in cellular processes of nuclear 
transport, nucleotide excision repair, nucleosome assembly, 
membrane transport and cy toskeletal regulation. 

In another study to identify proteases most suitable for drug 
targeting, an automated mlcrotltre-plate assay was modified to allow 
detection of the four main classes of proteases in tissue samples 
(matrix metalloproteases, cathepsins, and the cell serine proteases, 
tryptase and chy mase) 59 . Fifteen sets of colorectal carcinoma biopsies 
representing primary tumour, adjacent normal colon and liver 
metastases were screened for protease activity. Matrix metallopro- 
teases were expressed at higher levels in the primary tumour than in 
adjacent normal tissue. The mast cell proteases, in contrast, were 
found at very high levels in adjacent normal tissue, but were not 
detectable in the metastases. Cathepsin B activity was significantly 
higher in the primary tumour, and highest in the metastases. The 
proteases detected by activity assays were then localized in biopsy 
sections by lmmunohistochemlstry. Mast cell proteases were abun- 
dant in adjacent normal tissue, because of Infiltration of the lamina 
propria by mast cells. Matrix metalloproteases were localized to the 
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tumour cells themselves, whereas cathepsin B was expressed pre- 
dominantly by macrophages at the leading edge of invading tumours. 

Such activity-based screening provides a basis for selecting targets 
in the development of inhibitors to specific proteases. Protein 
biochips potentially could provide a high-throughput platform for 
target identification. Biochips developed for Interaction studies 
could be important in lead-compound optimization and could 
accelerate drug development by allowing efficient evaluation of lead 
compounds for specificity and selectivity in binding to drug targets. 
Proteomics also may provide increased efficiency of clinical trials 
through the availability of biologically relevant markers for drug 
efficacy and safety. 

Organizing proteomics initiatives 

It is clear that while some progress has been made in disease 
proteomics, the field Is still in its infancy. Knowledge of the sequence 
of the human genome has provided a framework for genomic 
approaches to unravel disease processes. A similar knowledge of the 
human proteome is currently lacking. Developing a comprehensive 
knowledge framework of the proteome is considerably more 
complex than sequencing the human genome. Ideally, such a 
proteome framework would encompass knowledge of all human 
proteins, from their sequence to their post-translational modifica- 
tions, to their interactions among each other, their cellular and sub- 
cellular distribution, and their temporal pattern of expression. 
Although such an exhaustive framework will not materialize in the 
foreseeable future, more modest goals may well be within reach. 

To that effect, there is a need to begin an organized effort, the goals 
of which include developing an infrastructure in proteomics that 
would substantially facilitate unravelling the complexity of the pro- 
teome in health and in disease. The Human Proteome Organisation 
(HUPO, http://www.hupo.org) was founded to regroup scientists In 
the public and private sectors engaged throughout the world In 



various aspects of proteomics. HUPOs mission is threefold: to 
consolidate national and regional proteome organizations into a 
worldwide organization; to engage in scientific and educational 
activities to encourage the spread of proteomics technologies and 
disseminate knowledge pertaining to the human proteome and that 
of model organisms; and to assist in the coordination of public 
proteome Initiatives aimed at characterizing specific tissue and cell 
proteomes. Initiatives currently In the pilot phase include an 
international effort to Identify proteins detectable in normal serum 
and plasma and their range of variation with age, ethnicity and 
physiological state, and a liver proteome study to identify proteins 
expressed in the liver. These initiatives have attracted substantial 
interest and will be integrated with efforts in protein Informatics to 
achieve data standardization on the one hand, and data curation on 
the other. 

Concluding remarks 

Proteome alterations in disease may occur in many different ways 
that are not predictable from genomic analysis, and it is clear that a 
better understanding of these alterations will have a substantial 
Impact in medicine. A useful repertoire of proteomics technologies is 
currently available for disease-related applications, although 
further technological innovations would be beneficial to increase 
sensitivity, reduce sample requirement, increase throughput and 
more effectively uncover various types of protein alterations such as 
post-translational modifications. The use of these technologies 
will likely expand substantially, particularly to meet the need for 
better diagnostics and to shorten the path for developing effective 
therapy. □ 
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